Mining Email Data

نویسنده

  • Steffen Bickel
چکیده

E-mail has become one of the most important communication media for business and private purposes. Large amounts of past e-mail records reside on corporate servers and desktop clients. There is a huge potential for mining this data. E-mail filing and spam filtering are wellestablished e-mail mining tasks. E-mail filing addresses the assignment of incoming e-mails to predefined categories to support selective reading and organize large e-mail collections. First research on e-mail filing was conducted by Green and Edwards (1996) and Cohen (1996). Pantel and Lin (1998) and Sahami, Dumais, Heckerman, and Horvitz (1998) first published work on spam filtering. Here, the goal is to filter unsolicited messages. Recent research on e-mail mining addresses automatic e-mail answering (Bickel & Scheffer, 2004) and mining social networks from e-mail logs (Tyler, Wilkinson, & Huberman, 2004). In Section Background we will categorize common email mining tasks according to their objective, and give an overview of the research literature. Our Main Thrust Section addresses e-mail mining with the objective of supporting the message creation process. Finally, we discuss Future Trends and conclude.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Study to Improve the Response in Email Campaigning by Comparing Data Mining Segmentation Approaches in Aditi Technologies

Email marketing is increasingly recognized as an effective Internet marketing tool. In this study, a questionnaire is constructed and distributed to a sample of 146 prospects of Aditi Technologies to find the factors associated with higher response rates. The collected data is analyzed using Factor Analysis and the 11 factors, From Line, Subject Line, Personalization of the subject line, Timing...

متن کامل

Analysis of Email Fraud detection using WEKA Tool

—Data mining is also being useful to give solutions for invasion finding and auditing. While data mining has several applications in protection, there are also serious privacy fears. Because of email mining, even inexperienced users can connect data and make responsive associations. Therefore we must to implement the privacy of persons while working on practical data mining. Using K-mean cluste...

متن کامل

Email mining toolkit supporting law enforcement forensic analyses

The Email Mining Toolkit (EMT) is a data mining tool that visualizes a very wide range of detailed analyses of email and email flows derived from an archive of email in a variety of formats. EMT may be leveraged in many applications. In this project we focus on providing support to detectives and analysts in law enforcement to develop powerful means of analyzing emails acquired under due proces...

متن کامل

Email Mining: Emerging Techniques for Email Management

Email has met tremendous popularity over the past few years. People are sending and receiving many messages per day, communicating with partners and friends, or exchanging files and information. Unfortunately, the phenomenon of email overload has grown over the past years becoming a personal headache for users and a financial issue for companies. In this chapter, we will discuss how disciplines...

متن کامل

Your Mark Is My Dirt: Impact of Email Signatures on Decision Making

In order to text mine email data it is important to address the substantial amount of noise usually contained in the data. This noise can skew the results of data mining and so reduce the effectiveness and efficiency of decision support systems that use these techniques. Ideally the noise is removed in pre-processing. The paper presents a case study of a series of steps to progressively clean a...

متن کامل

Comparison of machine learning techniques for handling multicollinearity in big data analytics and high - performance data mining Gerard

§ The insights gained from this study could be useful in selecting machine-learning methods for automated pre-processing of thousands of correlated variables in biomedical data mining. Conclusions Comparison of machine learning techniques for handling multicollinearity in big data analytics and high-performance data mining Gerard G. Dumancas1* and Ghalib Bello2 *1Oklahoma Baptist University, S...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009